首页> 外文OA文献 >A separability index for clustering and classification problems with applications to cluster merging and systematic evaluation of clustering algorithms
【2h】

A separability index for clustering and classification problems with applications to cluster merging and systematic evaluation of clustering algorithms

机译:聚类和分类问题的可分离性指标及其在聚类合并和聚类算法的系统评估中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A separability index quantifying the degree of difficulty in a hard clustering problem is proposed under assumptions of a multivariate Gaussian distribution for each group. We first define a preliminary index and explore its properties both theoretically and numerically. Adjustments are then made to this index so that the final refinement is also interpretable in terms of the Adjusted Rand Index between a true grouping and its hypothetical idealized clustering, taken as a surrogate of clustering complexity. Our derived index is used to develop a data-simulation algorithm that generates samples according to the prescribed value of the index. This algorithm is particularly useful for systematically generating datasets with varying degrees of clustering difficulty which we use to evaluate performance of different clustering algorithms. The index is also shown to be useful in providing a summary of the distinctiveness of classes in grouped datasets.
机译:在对每个组使用多变量高斯分布的假设下,提出了一种可量化性指标,用于量化硬聚类问题中的难度。我们首先定义一个初步指标,并在理论和数值上探索其性质。然后对该指数进行调整,以使最终的细化也可以根据真实分组与其假设的理想化聚类之间的调整后兰德指数来解释,以作为聚类复杂性的替代。我们导出的索引用于开发数据模拟算法,该算法根据索引的规定值生成样本。该算法对于系统地生成具有不同程度的聚类难度的数据集特别有用,我们用它来评估不同聚类算法的性能。该索引还显示出有助于提供分组数据集中类的独特性的摘要。

著录项

  • 作者

    Peterson, Anna Dagmar;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号